The UPV at GeoCLEF 2007

نویسندگان

  • Davide Buscaldi
  • Paolo Rosso
چکیده

In this work we attempted to determine the relative importance of the geographical and WordNet-extracted terms with respect to the remainder of the query. Our system is based on Lucene and uses LingPipe for Named Entity recognition. Geographical terms are expanded with WordNet holonyms and synonyms and indexed separately. We checked the relative importance of the terms by boosting them with reduction factors (0.75, 0.5 and 0.25). The comparison to the clean system (using only Lucene) shows that it is possible to improve the mean average precision if the importance of geographical terms is equal or less than the half with respect to the content words in the query. We also observed that WordNet holonyms may help in improving the recall but the term expansion is sensible to ambigue place names. As a further work, we will need to implement a toponym disambiguation method in order to reduce the impact of this kind of ambiguity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TALP at GeoQuery 2007: Linguistic and Geographical Analysis for Query Parsing

This paper describes our experiments on the Geographical Query Parsing pilot-task for English at GeoCLEF 2007. Our system uses some modules of a Geographical Information Retrieval system presented at GeoCLEF 2006 [3] and modified for GeoCLEF 2007. The system uses deep linguistic analysis and Geographical Knowledge to perform the task.

متن کامل

Monolingual Retrieval Experiments with Spatial Restrictions at GeoCLEF 2007

The participation of the University of Hildesheim focused on the monolingual German and English tasks of GeoCLEF 2007. Based on the results of GeoCLEF 2005 and GeoCLEF 2006, the weighting and expansion of geographic named entities (NE) and Blind Relevance Feedback were combined. This year an improved model for German Named Entity Recognition was evaluated.

متن کامل

Mono-and Crosslingual Retrieval Experiments with Spatial Restrictions at GeoCLEF 2007

The participation of the University of Hildesheim focused on the monolingual German and English tasks of GeoCLEF 2007. Based on the results of GeoCLEF 2005 and GeoCLEF 2006, the weighting and expansion of geographic Named Entities (NE) and Blind Relevance Feedback (BRF) were combined and an improved model for German Named Entity Recognition (NER) was evaluated. Post submission experiments are a...

متن کامل

University of Groningen at GeoCLEF 2007

This paper describes the approach of the University of Groningen to GeoCLEF task for CLEF 2007. We used geographic scope based approach to rank documents.

متن کامل

Re-Ranking for Geo-Relevance With Non-Contextual Heuristics at GeoCLEF 2007

Geographic Information Retrieval (GIR) in an attempt to improve relevance by taking geographic information in textual documents into account. We describe out experiments carried out at the GeoCLEF 2007 evaluation [1] that investigate further the role of geo-filtering based re-ranking and query expansion with geographic terms. Our main findings are that manual query expansion with geo-terms is m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007